Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 114144 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1683 |
| Duplicate rows (%) | 1.5% |
| Total size in memory | 19.8 MiB |
| Average record size in memory | 182.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 11 |
| Boolean | 6 |
Reason has constant value "RA" | Constant |
Is_year_start has constant value "False" | Constant |
| Dataset has 1683 (1.5%) duplicate rows | Duplicates |
Id has a high cardinality: 19353 distinct values | High cardinality |
Applied is highly correlated with Received and 2 other fields | High correlation |
Received is highly correlated with Applied and 2 other fields | High correlation |
logapplied is highly correlated with Applied and 2 other fields | High correlation |
logreceived is highly correlated with Applied and 2 other fields | High correlation |
Year is highly correlated with Elapsed | High correlation |
Month is highly correlated with Week and 1 other fields | High correlation |
Week is highly correlated with Month and 1 other fields | High correlation |
Dayofyear is highly correlated with Month and 1 other fields | High correlation |
Elapsed is highly correlated with Year | High correlation |
Year is highly correlated with Is_year_start and 1 other fields | High correlation |
Payment_Type is highly correlated with Payment_Method and 2 other fields | High correlation |
Is_quarter_end is highly correlated with Is_year_start and 1 other fields | High correlation |
Is_year_end is highly correlated with Is_year_start and 1 other fields | High correlation |
Age is highly correlated with AgeGroup and 2 other fields | High correlation |
Area is highly correlated with Is_year_start and 1 other fields | High correlation |
Gender is highly correlated with Is_year_start and 1 other fields | High correlation |
Is_quarter_start is highly correlated with Is_year_start and 1 other fields | High correlation |
Payment_Method is highly correlated with Payment_Type and 2 other fields | High correlation |
AgeGroup is highly correlated with Age and 2 other fields | High correlation |
Is_year_start is highly correlated with Year and 14 other fields | High correlation |
Location is highly correlated with Is_year_start and 1 other fields | High correlation |
Is_month_end is highly correlated with Is_year_start and 1 other fields | High correlation |
Reason is highly correlated with Year and 14 other fields | High correlation |
Is_month_start is highly correlated with Is_year_start and 1 other fields | High correlation |
True_False is highly correlated with Is_year_start and 1 other fields | High correlation |
Ratio is highly skewed (γ1 = 337.8511828) | Skewed |
Dayofweek has 21644 (19.0%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-26 11:24:28.256103 |
|---|---|
| Analysis finished | 2021-04-26 11:26:23.646696 |
| Duration | 1 minute and 55.39 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 1197 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 382.8623844 |
|---|---|
| Minimum | 0.001 |
| Maximum | 12000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | 0.001 |
|---|---|
| 5-th percentile | 106 |
| Q1 | 210 |
| median | 320 |
| Q3 | 500 |
| 95-th percentile | 880 |
| Maximum | 12000 |
| Range | 11999.999 |
| Interquartile range (IQR) | 290 |
Descriptive statistics
| Standard deviation | 257.818421 |
|---|---|
| Coefficient of variation (CV) | 0.6733971044 |
| Kurtosis | 47.41735371 |
| Mean | 382.8623844 |
| Median Absolute Deviation (MAD) | 130 |
| Skewness | 2.890217658 |
| Sum | 43701444 |
| Variance | 66470.33822 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 210 | 2941 | 2.6% |
| 400 | 2653 | 2.3% |
| 300 | 2628 | 2.3% |
| 500 | 2479 | 2.2% |
| 350 | 2341 | 2.1% |
| 250 | 2255 | 2.0% |
| 200 | 2166 | 1.9% |
| 600 | 2122 | 1.9% |
| 226 | 1987 | 1.7% |
| 450 | 1771 | 1.6% |
| Other values (1187) | 90801 |
| Value | Count | Frequency (%) |
| 0.001 | 1 | < 0.1% |
| 2 | 4 | |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 5 |
| Value | Count | Frequency (%) |
| 12000 | 1 | |
| 5981 | 1 | |
| 4305 | 1 | |
| 4215 | 1 | |
| 4107 | 1 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| F | |
|---|---|
| M | |
| GD | 20 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.000175217 |
| Min length | 1 |
Characters and Unicode
| Total characters | 114164 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | F |
| 4th row | F |
| 5th row | F |
| Value | Count | Frequency (%) |
| F | 71417 | |
| M | 42707 | |
| GD | 20 | < 0.1% |
| Value | Count | Frequency (%) |
| f | 71417 | |
| m | 42707 | |
| gd | 20 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 71417 | |
| M | 42707 | |
| G | 20 | < 0.1% |
| D | 20 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 114164 |
Most frequent character per category
| Value | Count | Frequency (%) |
| F | 71417 | |
| M | 42707 | |
| G | 20 | < 0.1% |
| D | 20 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 114164 |
Most frequent character per script
| Value | Count | Frequency (%) |
| F | 71417 | |
| M | 42707 | |
| G | 20 | < 0.1% |
| D | 20 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 114164 |
Most frequent character per block
| Value | Count | Frequency (%) |
| F | 71417 | |
| M | 42707 | |
| G | 20 | < 0.1% |
| D | 20 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| AV | |
|---|---|
| RP | |
| U | 4 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.999964957 |
| Min length | 1 |
Characters and Unicode
| Total characters | 228284 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AV |
|---|---|
| 2nd row | AV |
| 3rd row | AV |
| 4th row | AV |
| 5th row | AV |
| Value | Count | Frequency (%) |
| AV | 98803 | |
| RP | 15337 | 13.4% |
| U | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| av | 98803 | |
| rp | 15337 | 13.4% |
| u | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 98803 | |
| V | 98803 | |
| R | 15337 | 6.7% |
| P | 15337 | 6.7% |
| U | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 228284 |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 98803 | |
| V | 98803 | |
| R | 15337 | 6.7% |
| P | 15337 | 6.7% |
| U | 4 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 228284 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 98803 | |
| V | 98803 | |
| R | 15337 | 6.7% |
| P | 15337 | 6.7% |
| U | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 228284 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 98803 | |
| V | 98803 | |
| R | 15337 | 6.7% |
| P | 15337 | 6.7% |
| U | 4 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| M | |
|---|---|
| NE | |
| O | |
| PP | |
| U | 4075 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.403788197 |
| Min length | 1 |
Characters and Unicode
| Total characters | 160234 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | U |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
| Value | Count | Frequency (%) |
| M | 51873 | |
| NE | 34025 | |
| O | 12106 | 10.6% |
| PP | 12065 | 10.6% |
| U | 4075 | 3.6% |
| Value | Count | Frequency (%) |
| m | 51873 | |
| ne | 34025 | |
| o | 12106 | 10.6% |
| pp | 12065 | 10.6% |
| u | 4075 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 51873 | |
| N | 34025 | |
| E | 34025 | |
| P | 24130 | |
| O | 12106 | 7.6% |
| U | 4075 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 160234 |
Most frequent character per category
| Value | Count | Frequency (%) |
| M | 51873 | |
| N | 34025 | |
| E | 34025 | |
| P | 24130 | |
| O | 12106 | 7.6% |
| U | 4075 | 2.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 160234 |
Most frequent character per script
| Value | Count | Frequency (%) |
| M | 51873 | |
| N | 34025 | |
| E | 34025 | |
| P | 24130 | |
| O | 12106 | 7.6% |
| U | 4075 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 160234 |
Most frequent character per block
| Value | Count | Frequency (%) |
| M | 51873 | |
| N | 34025 | |
| E | 34025 | |
| P | 24130 | |
| O | 12106 | 7.6% |
| U | 4075 | 2.5% |
| Distinct | 3125 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 382.8572231 |
|---|---|
| Minimum | 0.21 |
| Maximum | 12000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | 0.21 |
|---|---|
| 5-th percentile | 106 |
| Q1 | 210 |
| median | 320 |
| Q3 | 500 |
| 95-th percentile | 880 |
| Maximum | 12000 |
| Range | 11999.79 |
| Interquartile range (IQR) | 290 |
Descriptive statistics
| Standard deviation | 257.8189139 |
|---|---|
| Coefficient of variation (CV) | 0.6734074697 |
| Kurtosis | 47.41708631 |
| Mean | 382.8572231 |
| Median Absolute Deviation (MAD) | 130 |
| Skewness | 2.890193348 |
| Sum | 43700854.87 |
| Variance | 66470.59234 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 210 | 2938 | 2.6% |
| 400 | 2650 | 2.3% |
| 300 | 2624 | 2.3% |
| 500 | 2464 | 2.2% |
| 350 | 2339 | 2.0% |
| 250 | 2250 | 2.0% |
| 200 | 2160 | 1.9% |
| 600 | 2121 | 1.9% |
| 226 | 1971 | 1.7% |
| 450 | 1769 | 1.5% |
| Other values (3115) | 90858 |
| Value | Count | Frequency (%) |
| 0.21 | 1 | < 0.1% |
| 2 | 3 | |
| 2.1 | 1 | < 0.1% |
| 2.5 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 12000 | 1 | |
| 5981 | 1 | |
| 4305 | 1 | |
| 4215 | 1 | |
| 4107.44 | 1 |
| Distinct | 19353 |
|---|---|
| Distinct (%) | 17.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| GHI000112669 | |
|---|---|
| GHI000753413 | 1056 |
| GHI001206283 | 974 |
| GHI000143648 | 639 |
| GHI001853440 | 588 |
| Other values (19348) |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Characters and Unicode
| Total characters | 1369728 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 12685 ? |
|---|---|
| Unique (%) | 11.1% |
Sample
| 1st row | GHI001040140 |
|---|---|
| 2nd row | GHI000096195 |
| 3rd row | GHI000873216 |
| 4th row | GHI000165164 |
| 5th row | GHI000085542 |
| Value | Count | Frequency (%) |
| GHI000112669 | 18223 | 16.0% |
| GHI000753413 | 1056 | 0.9% |
| GHI001206283 | 974 | 0.9% |
| GHI000143648 | 639 | 0.6% |
| GHI001853440 | 588 | 0.5% |
| GHI001354498 | 521 | 0.5% |
| GHI000100619 | 521 | 0.5% |
| GHI001086470 | 517 | 0.5% |
| GHI000086558 | 500 | 0.4% |
| GHI000437510 | 494 | 0.4% |
| Other values (19343) | 90111 |
| Value | Count | Frequency (%) |
| ghi000112669 | 18223 | 16.0% |
| ghi000753413 | 1056 | 0.9% |
| ghi001206283 | 974 | 0.9% |
| ghi000143648 | 639 | 0.6% |
| ghi001853440 | 588 | 0.5% |
| ghi000100619 | 521 | 0.5% |
| ghi001354498 | 521 | 0.5% |
| ghi001086470 | 517 | 0.5% |
| ghi000086558 | 500 | 0.4% |
| ghi000437510 | 494 | 0.4% |
| Other values (19343) | 90111 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 378588 | |
| 1 | 133816 | 9.8% |
| G | 114144 | 8.3% |
| H | 114144 | 8.3% |
| I | 114144 | 8.3% |
| 6 | 89866 | 6.6% |
| 2 | 73495 | 5.4% |
| 9 | 69765 | 5.1% |
| 4 | 61201 | 4.5% |
| 8 | 56666 | 4.1% |
| Other values (3) | 163899 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1027296 | |
| Uppercase Letter | 342432 | 25.0% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 378588 | |
| 1 | 133816 | 13.0% |
| 6 | 89866 | 8.7% |
| 2 | 73495 | 7.2% |
| 9 | 69765 | 6.8% |
| 4 | 61201 | 6.0% |
| 8 | 56666 | 5.5% |
| 3 | 55761 | 5.4% |
| 5 | 55459 | 5.4% |
| 7 | 52679 | 5.1% |
| Value | Count | Frequency (%) |
| G | 114144 | |
| H | 114144 | |
| I | 114144 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1027296 | |
| Latin | 342432 | 25.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 378588 | |
| 1 | 133816 | 13.0% |
| 6 | 89866 | 8.7% |
| 2 | 73495 | 7.2% |
| 9 | 69765 | 6.8% |
| 4 | 61201 | 6.0% |
| 8 | 56666 | 5.5% |
| 3 | 55761 | 5.4% |
| 5 | 55459 | 5.4% |
| 7 | 52679 | 5.1% |
| Value | Count | Frequency (%) |
| G | 114144 | |
| H | 114144 | |
| I | 114144 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1369728 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 378588 | |
| 1 | 133816 | 9.8% |
| G | 114144 | 8.3% |
| H | 114144 | 8.3% |
| I | 114144 | 8.3% |
| 6 | 89866 | 6.6% |
| 2 | 73495 | 5.4% |
| 9 | 69765 | 5.1% |
| 4 | 61201 | 4.5% |
| 8 | 56666 | 4.1% |
| Other values (3) | 163899 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| RA |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 228288 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RA |
|---|---|
| 2nd row | RA |
| 3rd row | RA |
| 4th row | RA |
| 5th row | RA |
| Value | Count | Frequency (%) |
| RA | 114144 |
| Value | Count | Frequency (%) |
| ra | 114144 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 114144 | |
| A | 114144 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 228288 |
Most frequent character per category
| Value | Count | Frequency (%) |
| R | 114144 | |
| A | 114144 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 228288 |
Most frequent character per script
| Value | Count | Frequency (%) |
| R | 114144 | |
| A | 114144 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 228288 |
Most frequent character per block
| Value | Count | Frequency (%) |
| R | 114144 | |
| A | 114144 |
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| 25-29 | |
|---|---|
| 20-24 | |
| 30-34 | |
| 35-39 | |
| 40-44 | |
| Other values (8) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.885819666 |
| Min length | 2 |
Characters and Unicode
| Total characters | 557687 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 25-29 |
|---|---|
| 2nd row | 20-24 |
| 3rd row | 20-24 |
| 4th row | 40-44 |
| 5th row | 25-29 |
| Value | Count | Frequency (%) |
| 25-29 | 20220 | |
| 20-24 | 17265 | |
| 30-34 | 16476 | |
| 35-39 | 13015 | |
| 40-44 | 10260 | |
| 45-49 | 9401 | |
| 50-54 | 7379 | 6.5% |
| 65+ | 5915 | 5.2% |
| 55-59 | 5698 | 5.0% |
| 60-64 | 4184 | 3.7% |
| Other values (3) | 4331 | 3.8% |
| Value | Count | Frequency (%) |
| 25-29 | 20220 | |
| 20-24 | 17265 | |
| 30-34 | 16476 | |
| 35-39 | 13015 | |
| 40-44 | 10260 | |
| 45-49 | 9401 | |
| 50-54 | 7379 | 6.5% |
| 65 | 5915 | 5.2% |
| 55-59 | 5698 | 5.0% |
| 60-64 | 4184 | 3.7% |
| Other values (3) | 4331 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 107828 | |
| 4 | 94886 | |
| 5 | 80403 | |
| 2 | 74970 | |
| 3 | 58982 | |
| 0 | 55564 | |
| 9 | 52264 | |
| 6 | 14357 | 2.6% |
| 1 | 8261 | 1.5% |
| + | 5915 | 1.1% |
| Other values (2) | 4257 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 443944 | |
| Dash Punctuation | 107828 | 19.3% |
| Math Symbol | 5915 | 1.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 4 | 94886 | |
| 5 | 80403 | |
| 2 | 74970 | |
| 3 | 58982 | |
| 0 | 55564 | |
| 9 | 52264 | |
| 6 | 14357 | 3.2% |
| 1 | 8261 | 1.9% |
| 8 | 3930 | 0.9% |
| 7 | 327 | 0.1% |
| Value | Count | Frequency (%) |
| - | 107828 |
| Value | Count | Frequency (%) |
| + | 5915 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 557687 |
Most frequent character per script
| Value | Count | Frequency (%) |
| - | 107828 | |
| 4 | 94886 | |
| 5 | 80403 | |
| 2 | 74970 | |
| 3 | 58982 | |
| 0 | 55564 | |
| 9 | 52264 | |
| 6 | 14357 | 2.6% |
| 1 | 8261 | 1.5% |
| + | 5915 | 1.1% |
| Other values (2) | 4257 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 557687 |
Most frequent character per block
| Value | Count | Frequency (%) |
| - | 107828 | |
| 4 | 94886 | |
| 5 | 80403 | |
| 2 | 74970 | |
| 3 | 58982 | |
| 0 | 55564 | |
| 9 | 52264 | |
| 6 | 14357 | 2.6% |
| 1 | 8261 | 1.5% |
| + | 5915 | 1.1% |
| Other values (2) | 4257 | 0.8% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| O | |
|---|---|
| AM | |
| C | |
| W | |
| BP | |
| Other values (6) |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.459603659 |
| Min length | 1 |
Characters and Unicode
| Total characters | 166605 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | T |
|---|---|
| 2nd row | O |
| 3rd row | O |
| 4th row | W |
| 5th row | C |
| Value | Count | Frequency (%) |
| O | 31704 | |
| AM | 29737 | |
| C | 12745 | |
| W | 7005 | 6.1% |
| BP | 6927 | 6.1% |
| T | 5909 | 5.2% |
| S | 5601 | 4.9% |
| NL | 4160 | 3.6% |
| EC | 3899 | 3.4% |
| Wlg | 3869 | 3.4% |
| Value | Count | Frequency (%) |
| o | 31704 | |
| am | 29737 | |
| c | 12745 | |
| w | 7005 | 6.1% |
| bp | 6927 | 6.1% |
| t | 5909 | 5.2% |
| s | 5601 | 4.9% |
| nl | 4160 | 3.6% |
| ec | 3899 | 3.4% |
| wlg | 3869 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 31704 | |
| A | 29737 | |
| M | 29737 | |
| C | 16644 | |
| W | 10874 | 6.5% |
| B | 6927 | 4.2% |
| P | 6927 | 4.2% |
| N | 6748 | 4.1% |
| T | 5909 | 3.5% |
| S | 5601 | 3.4% |
| Other values (4) | 15797 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 158867 | |
| Lowercase Letter | 7738 | 4.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| O | 31704 | |
| A | 29737 | |
| M | 29737 | |
| C | 16644 | |
| W | 10874 | 6.8% |
| B | 6927 | 4.4% |
| P | 6927 | 4.4% |
| N | 6748 | 4.2% |
| T | 5909 | 3.7% |
| S | 5601 | 3.5% |
| Other values (2) | 8059 | 5.1% |
| Value | Count | Frequency (%) |
| l | 3869 | |
| g | 3869 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 166605 |
Most frequent character per script
| Value | Count | Frequency (%) |
| O | 31704 | |
| A | 29737 | |
| M | 29737 | |
| C | 16644 | |
| W | 10874 | 6.5% |
| B | 6927 | 4.2% |
| P | 6927 | 4.2% |
| N | 6748 | 4.1% |
| T | 5909 | 3.5% |
| S | 5601 | 3.4% |
| Other values (4) | 15797 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 166605 |
Most frequent character per block
| Value | Count | Frequency (%) |
| O | 31704 | |
| A | 29737 | |
| M | 29737 | |
| C | 16644 | |
| W | 10874 | 6.5% |
| B | 6927 | 4.2% |
| P | 6927 | 4.2% |
| N | 6748 | 4.1% |
| T | 5909 | 3.5% |
| S | 5601 | 3.4% |
| Other values (4) | 15797 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| 0 | |
|---|---|
| 1 | 437 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 114144 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 113707 | |
| 1 | 437 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 113707 | |
| 1 | 437 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 113707 | |
| 1 | 437 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 114144 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 0 | 113707 | |
| 1 | 437 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 114144 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 0 | 113707 | |
| 1 | 437 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 114144 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 113707 | |
| 1 | 437 | 0.4% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| MidAge | |
|---|---|
| Adult | |
| Old | |
| Teenage | 401 |
Length
| Max length | 7 |
|---|---|
| Median length | 5 |
| Mean length | 5.03155663 |
| Min length | 3 |
Characters and Unicode
| Total characters | 574322 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Adult |
| 4th row | MidAge |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| MidAge | 49152 | |
| Adult | 41415 | |
| Old | 23176 | |
| Teenage | 401 | 0.4% |
| Value | Count | Frequency (%) |
| midage | 49152 | |
| adult | 41415 | |
| old | 23176 | |
| teenage | 401 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 113743 | |
| A | 90567 | |
| l | 64591 | |
| e | 50355 | |
| g | 49553 | |
| M | 49152 | |
| i | 49152 | |
| u | 41415 | 7.2% |
| t | 41415 | 7.2% |
| O | 23176 | 4.0% |
| Other values (3) | 1203 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 411026 | |
| Uppercase Letter | 163296 | 28.4% |
Most frequent character per category
| Value | Count | Frequency (%) |
| d | 113743 | |
| l | 64591 | |
| e | 50355 | |
| g | 49553 | |
| i | 49152 | |
| u | 41415 | 10.1% |
| t | 41415 | 10.1% |
| n | 401 | 0.1% |
| a | 401 | 0.1% |
| Value | Count | Frequency (%) |
| A | 90567 | |
| M | 49152 | |
| O | 23176 | 14.2% |
| T | 401 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 574322 |
Most frequent character per script
| Value | Count | Frequency (%) |
| d | 113743 | |
| A | 90567 | |
| l | 64591 | |
| e | 50355 | |
| g | 49553 | |
| M | 49152 | |
| i | 49152 | |
| u | 41415 | 7.2% |
| t | 41415 | 7.2% |
| O | 23176 | 4.0% |
| Other values (3) | 1203 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 574322 |
Most frequent character per block
| Value | Count | Frequency (%) |
| d | 113743 | |
| A | 90567 | |
| l | 64591 | |
| e | 50355 | |
| g | 49553 | |
| M | 49152 | |
| i | 49152 | |
| u | 41415 | 7.2% |
| t | 41415 | 7.2% |
| O | 23176 | 4.0% |
| Other values (3) | 1203 | 0.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| AV | |
|---|---|
| RPU |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.134400407 |
| Min length | 2 |
Characters and Unicode
| Total characters | 243629 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AV |
|---|---|
| 2nd row | AV |
| 3rd row | AV |
| 4th row | AV |
| 5th row | AV |
| Value | Count | Frequency (%) |
| AV | 98803 | |
| RPU | 15341 | 13.4% |
| Value | Count | Frequency (%) |
| av | 98803 | |
| rpu | 15341 | 13.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 98803 | |
| V | 98803 | |
| R | 15341 | 6.3% |
| P | 15341 | 6.3% |
| U | 15341 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 243629 |
Most frequent character per category
| Value | Count | Frequency (%) |
| A | 98803 | |
| V | 98803 | |
| R | 15341 | 6.3% |
| P | 15341 | 6.3% |
| U | 15341 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 243629 |
Most frequent character per script
| Value | Count | Frequency (%) |
| A | 98803 | |
| V | 98803 | |
| R | 15341 | 6.3% |
| P | 15341 | 6.3% |
| U | 15341 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 243629 |
Most frequent character per block
| Value | Count | Frequency (%) |
| A | 98803 | |
| V | 98803 | |
| R | 15341 | 6.3% |
| P | 15341 | 6.3% |
| U | 15341 | 6.3% |
| Distinct | 1197 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.750004317 |
|---|---|
| Minimum | -6.907755279 |
| Maximum | 9.392661929 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | -6.907755279 |
|---|---|
| 5-th percentile | 4.663439094 |
| Q1 | 5.347107531 |
| median | 5.768320996 |
| Q3 | 6.214608098 |
| 95-th percentile | 6.779921907 |
| Maximum | 9.392661929 |
| Range | 16.30041721 |
| Interquartile range (IQR) | 0.8675005677 |
Descriptive statistics
| Standard deviation | 0.6409119857 |
|---|---|
| Coefficient of variation (CV) | 0.1114628704 |
| Kurtosis | 1.601402047 |
| Mean | 5.750004317 |
| Median Absolute Deviation (MAD) | 0.4307829161 |
| Skewness | -0.2494680754 |
| Sum | 656328.4928 |
| Variance | 0.4107681735 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.347107531 | 2941 | 2.6% |
| 5.991464547 | 2653 | 2.3% |
| 5.703782475 | 2628 | 2.3% |
| 6.214608098 | 2479 | 2.2% |
| 5.857933154 | 2341 | 2.1% |
| 5.521460918 | 2255 | 2.0% |
| 5.298317367 | 2166 | 1.9% |
| 6.396929655 | 2122 | 1.9% |
| 5.420534999 | 1987 | 1.7% |
| 6.109247583 | 1771 | 1.6% |
| Other values (1187) | 90801 |
| Value | Count | Frequency (%) |
| -6.907755279 | 1 | < 0.1% |
| 0.6931471806 | 4 | |
| 1.098612289 | 2 | < 0.1% |
| 1.386294361 | 1 | < 0.1% |
| 1.609437912 | 5 |
| Value | Count | Frequency (%) |
| 9.392661929 | 1 | |
| 8.696343057 | 1 | |
| 8.367532417 | 1 | |
| 8.34640487 | 1 | |
| 8.320448114 | 1 |
| Distinct | 3125 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.750026064 |
|---|---|
| Minimum | -1.560647748 |
| Maximum | 9.392661929 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | -1.560647748 |
|---|---|
| 5-th percentile | 4.663439094 |
| Q1 | 5.347107531 |
| median | 5.768320996 |
| Q3 | 6.214608098 |
| 95-th percentile | 6.779921907 |
| Maximum | 9.392661929 |
| Range | 10.95330968 |
| Interquartile range (IQR) | 0.8675005677 |
Descriptive statistics
| Standard deviation | 0.6402153729 |
|---|---|
| Coefficient of variation (CV) | 0.1113412993 |
| Kurtosis | 0.4355168226 |
| Mean | 5.750026064 |
| Median Absolute Deviation (MAD) | 0.4321881782 |
| Skewness | -0.1963128263 |
| Sum | 656330.975 |
| Variance | 0.4098757237 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.347107531 | 2938 | 2.6% |
| 5.991464547 | 2650 | 2.3% |
| 5.703782475 | 2624 | 2.3% |
| 6.214608098 | 2464 | 2.2% |
| 5.857933154 | 2339 | 2.0% |
| 5.521460918 | 2250 | 2.0% |
| 5.298317367 | 2160 | 1.9% |
| 6.396929655 | 2121 | 1.9% |
| 5.420534999 | 1971 | 1.7% |
| 6.109247583 | 1769 | 1.5% |
| Other values (3115) | 90858 |
| Value | Count | Frequency (%) |
| -1.560647748 | 1 | < 0.1% |
| 0.6931471806 | 3 | |
| 0.7419373447 | 1 | < 0.1% |
| 0.9162907319 | 1 | < 0.1% |
| 1.098612289 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.392661929 | 1 | |
| 8.696343057 | 1 | |
| 8.367532417 | 1 | |
| 8.34640487 | 1 | |
| 8.320555242 | 1 |
| Distinct | 1993 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.001806261 |
|---|---|
| Minimum | 0.8333333333 |
| Maximum | 210 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | 0.8333333333 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 210 |
| Range | 209.1666667 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6186145925 |
|---|---|
| Coefficient of variation (CV) | 0.617499228 |
| Kurtosis | 114143.6145 |
| Mean | 1.001806261 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 337.8511828 |
| Sum | 114350.1738 |
| Variance | 0.3826840141 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 109859 | |
| 0.9974747475 | 55 | < 0.1% |
| 0.997983871 | 49 | < 0.1% |
| 0.9978991597 | 34 | < 0.1% |
| 0.9993359894 | 34 | < 0.1% |
| 0.9973404255 | 34 | < 0.1% |
| 0.9969325153 | 33 | < 0.1% |
| 0.9966216216 | 31 | < 0.1% |
| 0.9979423868 | 31 | < 0.1% |
| 0.999070632 | 30 | < 0.1% |
| Other values (1983) | 3954 | 3.5% |
| Value | Count | Frequency (%) |
| 0.8333333333 | 1 | |
| 0.9375 | 1 | |
| 0.94 | 1 | |
| 0.9444444444 | 1 | |
| 0.9642857143 | 1 |
| Value | Count | Frequency (%) |
| 210 | 1 | |
| 1.075 | 1 | |
| 1.06 | 1 | |
| 1.05 | 1 | |
| 1.032 | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 891.9 KiB |
| 2019 | |
|---|---|
| 2020 | |
| 2018 | |
| 2017 | |
| 2016 | 2051 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 456576 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2020 |
|---|---|
| 2nd row | 2020 |
| 3rd row | 2020 |
| 4th row | 2019 |
| 5th row | 2017 |
| Value | Count | Frequency (%) |
| 2019 | 30321 | |
| 2020 | 29656 | |
| 2018 | 26638 | |
| 2017 | 25478 | |
| 2016 | 2051 | 1.8% |
| Value | Count | Frequency (%) |
| 2019 | 30321 | |
| 2020 | 29656 | |
| 2018 | 26638 | |
| 2017 | 25478 | |
| 2016 | 2051 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 143800 | |
| 0 | 143800 | |
| 1 | 84488 | |
| 9 | 30321 | 6.6% |
| 8 | 26638 | 5.8% |
| 7 | 25478 | 5.6% |
| 6 | 2051 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 456576 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 2 | 143800 | |
| 0 | 143800 | |
| 1 | 84488 | |
| 9 | 30321 | 6.6% |
| 8 | 26638 | 5.8% |
| 7 | 25478 | 5.6% |
| 6 | 2051 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 456576 |
Most frequent character per script
| Value | Count | Frequency (%) |
| 2 | 143800 | |
| 0 | 143800 | |
| 1 | 84488 | |
| 9 | 30321 | 6.6% |
| 8 | 26638 | 5.8% |
| 7 | 25478 | 5.6% |
| 6 | 2051 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 456576 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 2 | 143800 | |
| 0 | 143800 | |
| 1 | 84488 | |
| 9 | 30321 | 6.6% |
| 8 | 26638 | 5.8% |
| 7 | 25478 | 5.6% |
| 6 | 2051 | 0.4% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.724724909 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.373737503 |
|---|---|
| Coefficient of variation (CV) | 0.5016915263 |
| Kurtosis | -1.147591316 |
| Mean | 6.724724909 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.1156909286 |
| Sum | 767587 |
| Variance | 11.38210474 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 10925 | |
| 7 | 10875 | |
| 11 | 10558 | |
| 10 | 10161 | |
| 9 | 10033 | |
| 6 | 9964 | |
| 5 | 9911 | |
| 3 | 9350 | |
| 12 | 8929 | |
| 2 | 8710 | |
| Other values (2) | 14728 |
| Value | Count | Frequency (%) |
| 1 | 8284 | |
| 2 | 8710 | |
| 3 | 9350 | |
| 4 | 6444 | |
| 5 | 9911 |
| Value | Count | Frequency (%) |
| 12 | 8929 | |
| 11 | 10558 | |
| 10 | 10161 | |
| 9 | 10033 | |
| 8 | 10925 |
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.58487525 |
|---|---|
| Minimum | 1 |
| Maximum | 52 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 15 |
| median | 28 |
| Q3 | 40 |
| 95-th percentile | 50 |
| Maximum | 52 |
| Range | 51 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.67184292 |
|---|---|
| Coefficient of variation (CV) | 0.5318799811 |
| Kurtosis | -1.164333717 |
| Mean | 27.58487525 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.1112622406 |
| Sum | 3148648 |
| Variance | 215.2629748 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 48 | 2760 | 2.4% |
| 49 | 2572 | 2.3% |
| 51 | 2537 | 2.2% |
| 31 | 2530 | 2.2% |
| 26 | 2507 | 2.2% |
| 33 | 2506 | 2.2% |
| 27 | 2505 | 2.2% |
| 25 | 2478 | 2.2% |
| 35 | 2463 | 2.2% |
| 38 | 2448 | 2.1% |
| Other values (42) | 88838 |
| Value | Count | Frequency (%) |
| 1 | 855 | 0.7% |
| 2 | 1838 | |
| 3 | 2278 | |
| 4 | 2216 | |
| 5 | 2051 |
| Value | Count | Frequency (%) |
| 52 | 1128 | |
| 51 | 2537 | |
| 50 | 2325 | |
| 49 | 2572 | |
| 48 | 2760 |
Day
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.8257289 |
|---|---|
| Minimum | 1 |
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.664953992 |
|---|---|
| Coefficient of variation (CV) | 0.5475232164 |
| Kurtosis | -1.155008974 |
| Mean | 15.8257289 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.005606195007 |
| Sum | 1806412 |
| Variance | 75.08142768 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 4261 | 3.7% |
| 19 | 4181 | 3.7% |
| 13 | 4079 | 3.6% |
| 21 | 3968 | 3.5% |
| 17 | 3955 | 3.5% |
| 11 | 3940 | 3.5% |
| 27 | 3916 | 3.4% |
| 12 | 3912 | 3.4% |
| 24 | 3908 | 3.4% |
| 5 | 3886 | 3.4% |
| Other values (21) | 74138 |
| Value | Count | Frequency (%) |
| 1 | 3278 | |
| 2 | 3438 | |
| 3 | 3480 | |
| 4 | 3594 | |
| 5 | 3886 |
| Value | Count | Frequency (%) |
| 31 | 2314 | |
| 30 | 3353 | |
| 29 | 3449 | |
| 28 | 3557 | |
| 27 | 3916 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.089457177 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 21644 |
| Zeros (%) | 19.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.43620049 |
|---|---|
| Coefficient of variation (CV) | 0.6873557908 |
| Kurtosis | -1.279331378 |
| Mean | 2.089457177 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.0553214757 |
| Sum | 238499 |
| Variance | 2.062671848 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 24789 | |
| 3 | 23421 | |
| 2 | 22230 | |
| 0 | 21644 | |
| 1 | 21421 | |
| 5 | 635 | 0.6% |
| 6 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 21644 | |
| 1 | 21421 | |
| 2 | 22230 | |
| 3 | 23421 | |
| 4 | 24789 |
| Value | Count | Frequency (%) |
| 6 | 4 | < 0.1% |
| 5 | 635 | 0.6% |
| 4 | 24789 | |
| 3 | 23421 | |
| 2 | 22230 |
| Distinct | 359 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 189.3833841 |
|---|---|
| Minimum | 3 |
| Maximum | 365 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 100 |
| median | 194 |
| Q3 | 276 |
| 95-th percentile | 345 |
| Maximum | 365 |
| Range | 362 |
| Interquartile range (IQR) | 176 |
Descriptive statistics
| Standard deviation | 102.7471838 |
|---|---|
| Coefficient of variation (CV) | 0.5425353669 |
| Kurtosis | -1.158843022 |
| Mean | 189.3833841 |
| Median Absolute Deviation (MAD) | 87 |
| Skewness | -0.1055779516 |
| Sum | 21616977 |
| Variance | 10556.98378 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 213 | 557 | 0.5% |
| 227 | 524 | 0.5% |
| 354 | 520 | 0.5% |
| 311 | 514 | 0.5% |
| 226 | 513 | 0.4% |
| 332 | 507 | 0.4% |
| 318 | 501 | 0.4% |
| 331 | 500 | 0.4% |
| 304 | 499 | 0.4% |
| 276 | 498 | 0.4% |
| Other values (349) | 109011 |
| Value | Count | Frequency (%) |
| 3 | 192 | |
| 4 | 215 | |
| 5 | 145 | |
| 6 | 172 | |
| 7 | 191 |
| Value | Count | Frequency (%) |
| 365 | 188 | |
| 364 | 147 | |
| 363 | 141 | |
| 362 | 155 | |
| 361 | 292 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| False | |
|---|---|
| True | 3864 |
| Value | Count | Frequency (%) |
| False | 110280 | |
| True | 3864 | 3.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| False | |
|---|---|
| True | 3278 |
| Value | Count | Frequency (%) |
| False | 110866 | |
| True | 3278 | 2.9% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| False | |
|---|---|
| True | 810 |
| Value | Count | Frequency (%) |
| False | 113334 | |
| True | 810 | 0.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| False | |
|---|---|
| True | 767 |
| Value | Count | Frequency (%) |
| False | 113377 | |
| True | 767 | 0.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| False | |
|---|---|
| True | 129 |
| Value | Count | Frequency (%) |
| False | 114015 | |
| True | 129 | 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 111.6 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 114144 |
| Distinct | 1123 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1547631170 |
|---|---|
| Minimum | 1480550400 |
| Maximum | 1606694400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 891.9 KiB |
Quantile statistics
| Minimum | 1480550400 |
|---|---|
| 5-th percentile | 1488153600 |
| Q1 | 1516233600 |
| median | 1550016000 |
| Q3 | 1579132800 |
| 95-th percentile | 1602028800 |
| Maximum | 1606694400 |
| Range | 126144000 |
| Interquartile range (IQR) | 62899200 |
Descriptive statistics
| Standard deviation | 36734422.09 |
|---|---|
| Coefficient of variation (CV) | 0.02373590219 |
| Kurtosis | -1.19025494 |
| Mean | 1547631170 |
| Median Absolute Deviation (MAD) | 31276800 |
| Skewness | -0.1176796452 |
| Sum | 1.766528123 × 1014 |
| Variance | 1.349417766 × 1015 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1577059200 | 204 | 0.2% |
| 1576800000 | 184 | 0.2% |
| 1596153600 | 183 | 0.2% |
| 1595203200 | 181 | 0.2% |
| 1606694400 | 180 | 0.2% |
| 1603756800 | 180 | 0.2% |
| 1600646400 | 179 | 0.2% |
| 1564358400 | 178 | 0.2% |
| 1572825600 | 174 | 0.2% |
| 1576713600 | 172 | 0.2% |
| Other values (1113) | 112329 |
| Value | Count | Frequency (%) |
| 1480550400 | 117 | |
| 1480636800 | 93 | |
| 1480896000 | 81 | |
| 1480982400 | 88 | |
| 1481068800 | 148 |
| Value | Count | Frequency (%) |
| 1606694400 | 180 | |
| 1606521600 | 12 | < 0.1% |
| 1606435200 | 172 | |
| 1606348800 | 162 | |
| 1606262400 | 149 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Applied | Gender | Payment_Method | Location | Received | Id | Reason | Age | Area | True_False | AgeGroup | Payment_Type | logapplied | logreceived | Ratio | Year | Month | Week | Day | Dayofweek | Dayofyear | Is_month_end | Is_month_start | Is_quarter_end | Is_quarter_start | Is_year_end | Is_year_start | Elapsed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 95.0 | M | AV | M | 95.0 | GHI001040140 | RA | 25-29 | T | 0 | Adult | AV | 4.553877 | 4.553877 | 1.0 | 2020 | 10 | 42 | 16 | 4 | 290 | False | False | False | False | False | False | 1602806400 |
| 1 | 90.0 | M | AV | U | 90.0 | GHI000096195 | RA | 20-24 | O | 0 | Adult | AV | 4.499810 | 4.499810 | 1.0 | 2020 | 5 | 20 | 13 | 2 | 134 | False | False | False | False | False | False | 1589328000 |
| 2 | 85.0 | F | AV | M | 85.0 | GHI000873216 | RA | 20-24 | O | 0 | Adult | AV | 4.442651 | 4.442651 | 1.0 | 2020 | 9 | 40 | 29 | 1 | 273 | False | False | False | False | False | False | 1601337600 |
| 3 | 425.0 | F | AV | M | 425.0 | GHI000165164 | RA | 40-44 | W | 0 | MidAge | AV | 6.052089 | 6.052089 | 1.0 | 2019 | 7 | 28 | 10 | 2 | 191 | False | False | False | False | False | False | 1562716800 |
| 4 | 450.0 | F | AV | M | 450.0 | GHI000085542 | RA | 25-29 | C | 0 | Adult | AV | 6.109248 | 6.109248 | 1.0 | 2017 | 9 | 38 | 18 | 0 | 261 | False | False | False | False | False | False | 1505692800 |
| 5 | 370.0 | M | AV | M | 370.0 | GHI000100619 | RA | 25-29 | C | 0 | Adult | AV | 5.913503 | 5.913503 | 1.0 | 2017 | 8 | 32 | 11 | 4 | 223 | False | False | False | False | False | False | 1502409600 |
| 6 | 590.0 | F | AV | M | 590.0 | GHI000077859 | RA | 25-29 | O | 0 | Adult | AV | 6.380123 | 6.380123 | 1.0 | 2020 | 9 | 38 | 16 | 2 | 260 | False | False | False | False | False | False | 1600214400 |
| 7 | 210.0 | F | AV | M | 210.0 | GHI000112669 | RA | 50-54 | O | 0 | Old | AV | 5.347108 | 5.347108 | 1.0 | 2018 | 1 | 4 | 24 | 2 | 24 | False | False | False | False | False | False | 1516752000 |
| 8 | 680.0 | F | AV | M | 680.0 | GHI000307469 | RA | 30-34 | O | 0 | MidAge | AV | 6.522093 | 6.522093 | 1.0 | 2018 | 12 | 1 | 31 | 0 | 365 | True | False | True | False | True | False | 1546214400 |
| 9 | 220.0 | M | AV | M | 220.0 | GHI000612397 | RA | 65+ | AM | 0 | Old | AV | 5.393628 | 5.393628 | 1.0 | 2019 | 6 | 23 | 7 | 4 | 158 | False | False | False | False | False | False | 1559865600 |
Last rows
| Applied | Gender | Payment_Method | Location | Received | Id | Reason | Age | Area | True_False | AgeGroup | Payment_Type | logapplied | logreceived | Ratio | Year | Month | Week | Day | Dayofweek | Dayofyear | Is_month_end | Is_month_start | Is_quarter_end | Is_quarter_start | Is_year_end | Is_year_start | Elapsed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 114134 | 230.0 | M | AV | O | 230.00 | GHI000077856 | RA | 50-54 | AM | 0 | Old | AV | 5.438079 | 5.438079 | 1.000000 | 2018 | 12 | 52 | 24 | 0 | 358 | False | False | False | False | False | False | 1545609600 |
| 114135 | 290.0 | F | AV | NE | 290.00 | GHI001459210 | RA | 65+ | S | 0 | Old | AV | 5.669881 | 5.669881 | 1.000000 | 2019 | 7 | 28 | 11 | 3 | 192 | False | False | False | False | False | False | 1562803200 |
| 114136 | 260.0 | F | RP | M | 260.00 | GHI001324033 | RA | 35-39 | S | 0 | MidAge | RPU | 5.560682 | 5.560682 | 1.000000 | 2017 | 4 | 17 | 28 | 4 | 118 | False | False | False | False | False | False | 1493337600 |
| 114137 | 580.0 | M | AV | M | 580.00 | GHI000088371 | RA | 25-29 | AM | 0 | Adult | AV | 6.363028 | 6.363028 | 1.000000 | 2018 | 12 | 50 | 14 | 4 | 348 | False | False | False | False | False | False | 1544745600 |
| 114138 | 106.0 | M | AV | PP | 106.00 | GHI000112669 | RA | 60-64 | O | 0 | Old | AV | 4.663439 | 4.663439 | 1.000000 | 2018 | 11 | 48 | 27 | 1 | 331 | False | False | False | False | False | False | 1543276800 |
| 114139 | 210.0 | F | AV | M | 210.00 | GHI001304576 | RA | 25-29 | BP | 0 | Adult | AV | 5.347108 | 5.347108 | 1.000000 | 2017 | 9 | 36 | 8 | 4 | 251 | False | False | False | False | False | False | 1504828800 |
| 114140 | 480.0 | M | AV | M | 480.00 | GHI000222992 | RA | 20-24 | EC | 0 | Adult | AV | 6.173786 | 6.173786 | 1.000000 | 2020 | 10 | 44 | 27 | 1 | 301 | False | False | False | False | False | False | 1603756800 |
| 114141 | 344.0 | F | AV | O | 344.12 | GHI000722944 | RA | 35-39 | AM | 0 | MidAge | AV | 5.840642 | 5.840990 | 1.000349 | 2017 | 2 | 9 | 28 | 1 | 59 | True | False | False | False | False | False | 1488240000 |
| 114142 | 220.0 | F | AV | NE | 220.00 | GHI001224192 | RA | 50-54 | W | 0 | Old | AV | 5.393628 | 5.393628 | 1.000000 | 2017 | 8 | 35 | 28 | 0 | 240 | False | False | False | False | False | False | 1503878400 |
| 114143 | 226.0 | F | AV | PP | 226.00 | GHI000076617 | RA | 25-29 | O | 0 | Adult | AV | 5.420535 | 5.420535 | 1.000000 | 2019 | 10 | 44 | 30 | 2 | 303 | False | False | False | False | False | False | 1572393600 |
Most frequent
| Applied | Gender | Payment_Method | Location | Received | Id | Reason | Age | Area | True_False | AgeGroup | Payment_Type | logapplied | logreceived | Ratio | Year | Month | Week | Day | Dayofweek | Dayofyear | Is_month_end | Is_month_start | Is_quarter_end | Is_quarter_start | Is_year_end | Is_year_start | Elapsed | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 911 | 226.0 | F | AV | M | 226.0 | GHI000112669 | RA | 25-29 | O | 0 | Adult | AV | 5.420535 | 5.420535 | 1.0 | 2019 | 8 | 34 | 19 | 0 | 231 | False | False | False | False | False | False | 1566172800 | 8 |
| 895 | 226.0 | F | AV | M | 226.0 | GHI000112669 | RA | 20-24 | O | 0 | Adult | AV | 5.420535 | 5.420535 | 1.0 | 2020 | 3 | 12 | 16 | 0 | 76 | False | False | False | False | False | False | 1584316800 | 6 |
| 934 | 226.0 | F | AV | M | 226.0 | GHI000112669 | RA | 25-29 | O | 0 | Adult | AV | 5.420535 | 5.420535 | 1.0 | 2019 | 12 | 52 | 23 | 0 | 357 | False | False | False | False | False | False | 1577059200 | 6 |
| 961 | 226.0 | F | AV | M | 226.0 | GHI000112669 | RA | 30-34 | O | 0 | MidAge | AV | 5.420535 | 5.420535 | 1.0 | 2019 | 8 | 31 | 1 | 3 | 213 | False | True | False | False | False | False | 1564617600 | 6 |
| 281 | 208.0 | F | AV | M | 208.0 | GHI000112669 | RA | 20-24 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2016 | 12 | 49 | 8 | 3 | 343 | False | False | False | False | False | False | 1481155200 | 5 |
| 283 | 208.0 | F | AV | M | 208.0 | GHI000112669 | RA | 20-24 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2016 | 12 | 51 | 22 | 3 | 357 | False | False | False | False | False | False | 1482364800 | 5 |
| 299 | 208.0 | F | AV | M | 208.0 | GHI000112669 | RA | 20-24 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2017 | 3 | 12 | 20 | 0 | 79 | False | False | False | False | False | False | 1489968000 | 5 |
| 304 | 208.0 | F | AV | M | 208.0 | GHI000112669 | RA | 25-29 | O | 0 | Adult | AV | 5.337538 | 5.337538 | 1.0 | 2016 | 12 | 49 | 7 | 2 | 342 | False | False | False | False | False | False | 1481068800 | 5 |
| 458 | 210.0 | F | AV | M | 210.0 | GHI000112669 | RA | 20-24 | O | 0 | Adult | AV | 5.347108 | 5.347108 | 1.0 | 2017 | 11 | 46 | 14 | 1 | 318 | False | False | False | False | False | False | 1510617600 | 5 |
| 507 | 210.0 | F | AV | M | 210.0 | GHI000112669 | RA | 25-29 | O | 0 | Adult | AV | 5.347108 | 5.347108 | 1.0 | 2017 | 10 | 44 | 31 | 1 | 304 | True | False | False | False | False | False | 1509408000 | 5 |